English-Chinese Cross-Language IR Using Bilingual Dictionaries
نویسندگان
چکیده
This report describes the English-Chinese crosslanguage experiments at Berkeley for TREC-9 CrossLanguage Information Retrieval track. We present a simple and effective Chinese word segmentation method and compare the cross-language retrieval performance of two bilingual dictionaries for query translation.
منابع مشابه
English-Chinese Cross-Language Information Retrieval using Lucene Toolkit1
In this paper, we present our English-Chinese Cross-Language Information Retrieval (CLIR) system. We focus our attention on finding effective translation equivalents between English and Chinese, and improving the performance of Chinese IR. On English-Chinese CLIR, we adopt query translation as the dominant strategy, and utilize English-Chinese bilingual dictionary as the important knowledge res...
متن کاملNTCIR-5 CLIR Experiments at Oki
We participated in the SLIR, BLIR(PLIR) and MLIR subtasks of the NTCIR-5 CLIR task. Our IR system uses language models for document scoring and query expansion, and can handle four languages; Chinese, Japanese, Korean and English. The system utilizes multiple language resources (bilingual dictionaries, parallel corpora and machine translation systems). We attempted to use some techniques includ...
متن کاملNTCIR-4 CLIR Experiments at Oki
We participated in SLIR, BLIR(PLIR) and MLIR subtasks at the NTCIR-4 CLIR task. Our IR system can handle queries and documents in Chinese, English and Japanese. The system utilizes multiple language resources (bilingual dictionaries, parallel corpora and machine translation systems) for query translation. We adopted the pivot language approach for C-J and J-C search using English as a pivot lan...
متن کاملPhrasal Translation for English-Chinese Cross Language Information Retrieval
This paper introduces a simple and effective nonoverlapping unigram and bigram segmentation method for both monolingual Chinese and English-Chinese cross language retrieval. It also describes English-Chinese cross language retrieval experiments involving 54 topics and some 164,000 documents. The translation of English queries to Chinese is done using a Chinese-English dictionary of about 120,00...
متن کاملEnglish-Chinese Cross-Language Retrieval based on a Translation Package
An inexpensive COTS translation package, augmented with a downloadable bilingual dictionary, was employed for a study of English-Chinese cross-language information retrieval (CLIR) using the query translation approach. The experimental setting involved the 170 MB Chinese collections and 54 queries of TREC and their relevance judgment, and our PIRCS bi-lingual retrieval system. With some standar...
متن کامل